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Abstract 

Funding bodies are increasingly recognizing the need to provide graduates and researchers with access to short 
intensive courses in a variety of disciplines, in order both to improve the general skills base and to provide solid 
foundations on which researchers may build their careers. In response to the development of 'high-throughput biol- 
ogy', the need for training in the field of bioinformatics, in particular, is seeing a resurgence: it has been defined as 
a key priority by many Institutions and research programmes and is now an important component of many grant 
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proposals. Nevertheless, when it comes to planning and preparing to meet such training needs, tension arises be- 
tween the reward structures that predominate in the scientific community which compel individuals to publish or 
perish, and the time that must be devoted to the design, delivery and maintenance of high-quality training materials. 
Conversely, there is much relevant teaching material and training expertise available worldwide that, were it prop- 
erly organized, could be exploited by anyone who needs to provide training or needs to set up a new course. To 
do this, however, the materials would have to be centralized in a database and clearly tagged in relation to target 
audiences, learning objectives, etc. Ideally, they would also be peer reviewed, and easily and efficiently accessible 
for downloading. Here, we present the Bioinformatics Training Network (BTN), a new enterprise that has been 
initiated to address these needs and review it, respectively, to similar initiatives and collections. 

Keywords: Bioinformatics; training; end users; bioinformatics courses; learning bioinformatics 



INTRODUCTION 

As biological data have become increasingly central 
to the life sciences, there is a need both for profes- 
sional bioinformaticians and for 'data savvy' life sci- 
entists v^ho are comfortable accessing, submitting 
and analysing data. Over the past 10 years, higher 
education institutions have begun to offer under- 
graduate and postgraduate courses in bioinformatics 
and computational biology. This is addressing the 
need for professional bioinformaticians but does 
not address the very different needs of the growing 
and diversifying number of life scientists who need to 
be able to access, submit and analyse biological data. 
Bioinformatics training during life science under- 
graduate studies could partially address this need, 
but is not yet universal in life-science curricula. 
The demand for a variety of short, focused courses, 
ranging from introductory to advanced levels, has 
therefore grown, and continues to grow, enormously 
[1]. This growth is also partly related to rapid 
technological advances in experimental techniques, 
especially those that characterize modern 'high- 
throughput biology', and the parallel evolution of 
the bioinformatics tools and resources necessary to 
manage and analyse the vast quantities of data these 
are producing [2]. 



Currently, short courses (typically from 1 to 5 days 
in length) in various bioinformatics topics are 
delivered worldwide. They generally represent a 
compromise between content depth and time, 
where workshops cater to heterogeneous target audi- 
ences and provide focused bioinformatics training in 
relatively short time-frames. Anyone involved in 
organizing and delivering such courses faces common 
(and not trivial) challenges [3]. The Bioinformatics 
Training Network (BTN), a community-led project 
that arose in response to these challenges, aims to 
provide platform supported, pragmatic solutions for 
the exchange of expertise, training materials and 
training experiences. Here, we describe this effort 
and invite anyone involved in bioinformatics training 
to engage with, and contribute to, this community 
effort. 

WHAT CAN YOU DO WITH THE 
BTN? 

The BTN website (http://www.biotnet.org/) is a 
centralized resource for the bioinformatics commu- 
nity to share training materials and techniques, and to 
disseminate awareness and spark new discussions and 
initiatives. The global community is welcome to 
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browse the site, and we encourage, in particular, 
those who are interested or involved in bioinformat- 
ics training to join the BTN. All materials in the 
BTN are made freely available under the Creative 
Commons Attribution-Share Alike 2.5 License 
(http : / / creativecommons.org/licenses/by-sa/ 2.5/). 
The site has been developed by the European 
Bioinformatics Institute (EMBL-EBI http: //www 
.ebi.ac.uk/), using the Open Source content man- 
agement system Drupal (http://drupal.org/). To fa- 
cilitate searching, a key feature of the site is the use of 
tag words. These are currently added as free text 
when users upload their materials, but ultimately 
will be compiled into a controlled vocabulary 
reflecting the most frequently used terminology 
and using available established ontologies when ap- 
propriate. All terms used as tags are searchable, allow- 
ing users to sort information at all levels. This 
facilitates efficient exploration, leading users more 
rapidly to relevant information (e.g. tag 'PhD': by 
clicking on the term PhD, users obtain details of all 
trainers registered with the site who provide training 
for PhD students) . The five main areas of the BTN 
website are described in detail below. 

Trainers and organizers 

This section lists individuals who have registered to 
contribute to the site and are involved in bioinfor- 
matics training; this includes not only those deliver- 
ing training sessions directly, but also those involved 
in the organization and planning of such training. 
The rationale behind this is 2-fold: first, to build up 
a community in which trainers can identify each 
other and share areas of expertise, and in which 
they can liaise with, and get help from, other trainers; 
second, to function as a hub for course organizers to 
identify potential trainers for specific subjects and 
levels. 

Training materials 

This section is the main focus of the BTN site. This 
section is a direct response to the often-voiced desire 
to have a common repository of well-documented, 
open-source training materials. It is specifically in- 
tended to help bioinformatics trainers to share, adapt 
and develop materials, and to make them freely 
available to the community at large. It is hoped 
that both having a centralized repository and foster- 
ing a culture of sharing will help new ideas and best 
practices to be readily and rapidly disseminated. To 
this end, trainers have the opportunity to rank each 



other's materials, in a similar way to the Amazon 
ranking system. This includes a comments box to 
allow trainers to state exactly what was rated and 
the reason for the given rankings. This functionality 
aims to stimulate constructive feedback and, conse- 
quently, to help improve the quality and timeliness 
of materials available from the site (Figure 1). 

Training materials are grouped into 10 major cate- 
gories, according to the types of data handled by 
different tools and resources: Genes and Genomes; 
Proteins and Proteomes; RNAs and Transcrip tomes; 
Interactions and Pathways; Small molecules and 
Metabolomes; Ontologies; Literature and Text 
mining; Programming and Operating Systems; 
Biostatistics; and Other. Where appropriate, mater- 
ials can be tagged to more than one section 
(e.g. 'drug discovery' is relevant to both proteins 
and small molecules). These categories have been 
derived from extensive discussions among the con- 
tributing authors. However, as the site grows and the 
community enlarges, the categories will doubtless 
evolve to reflect wider community needs and inter- 
ests. Special care will be taken to monitor the con- 
tent of the 'Other' section, to avoid this growing 
amorphously, and making it more and more difficult 
to find relevant materials within it. 

Anyone can readily access and download materials 
from the BTN site without a requirement for regis- 
tration. However, to upload information, registra- 
tion is necessary. To register, contributors need 
only provide a brief motivation: this is required in 
order to ensure that requests are genuine. Apart from 
a textual description, uploaded materials can include 
files with any of the following extensions: ppt, pptx, 
doc, docx, xls, xlsx, pdf, txt. Other formats 
(e.g. videos) can be incorporated as external hyper- 
links. At present, the maximum file size allowed 
per upload is 64 MB. 

While aiming to provide a comprehensive reposi- 
tory of training materials, the BTN nevertheless 
wishes to avoid redundancy. Therefore, for those 
who want to share materials but already have a 
location from which these are available, the BTN 
site offers the ability simply to add a link to remote 
material rather than requiring them to be uploaded 
again. The materials, links and classifications (espe- 
cially the 'Other' section) will be reviewed period- 
ically by appointed BTN members to provide the 
greatest functionality to the community, while re- 
taining system security and ensuring that the BTN 
works within system capacities. 



386 



Schneider et al. 



t n F o r m a tics 



Home Ti-a I ne rs * Organisers 



Sftarch 



Starch this sftei 



Contact Us 



Log in Rtgirtaf 



T-alning Facilities Courses & Events Useful Links About Us 



TRMNfNG Materials 



Document Categories 








Gint i Gencimif 


Protein i 




Interactioni & 


Sfratl molecules 4 


Ontotoal «s 


Litarsture fiTesrt 
mining 


Pronramminc S. 


Bi Oft atl sties 


Otltar 



Most Recently Updated 



litis 


Updated 


Introduction to Bioovthon 


4^ar^ 5 hours 


Mioroarriv Ditj AtjI^Is 




Ertserrtil [yresentjtloiis and 
ccuEe booklet 


ago 






R Basics 


ago 



Most Recently Rated 


Title 


Rati ng 


Hislorv ct bioinfofmatitfs 




1 ntrodu dio n io B iopvtho n 




Introduction to protein sequence 




jnalysis 




Pvthcft croarammlna tor bsomnets 




- handouts and exetetees 


EMBL-EBl refourcQS and tools quidk 




list 





O2aij0-2011 Biofnforrratjcs Training Netwoitt. All Right R«sanrad. 

Figure I: Screen dump of one of the BTN web pages showing the section about "Training Materials" containing a 
top level list of topics (categories), a list of the most recently updated materials (on the left) and a list of the most 
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Finally, to minimize the risk that the BTN accu- 
mulates dysfunctional or outdated materials, auto- 
matically generated e-mails will be sent to the 
submitter 2 years after the submission date asking 
them to either confirm that their materials are still 
valid and useful or to agree to the deletion of their 
materials from the site. 

Training facilities 

By sharing our own trainer experiences, it emerged 
how challenging it can be to get a quick overview of 
available IT-training facilities and appropriate venues 
(worldwide) . This section was created to allow course 
organizers and providers to list facilities suitable 
for hosting bioinformatics training events. At the 
moment, this is a simple list containing the venue 
name, location, a brief description, and the relevant 



contact details. In the future, as the community 
grows, a map view will also be developed to allow 
rapid identification of suitable venues, together with 
the contact details of the responsible organization/ 
institute. 

Courses and events 

Raising awareness of training events is often a sig- 
nificant task for course organizers. Although the pri- 
mary intention of the BTN site is to address the 
needs of trainers and course organizers, it emerged 
from our discussions that being able to identify other 
courses in the same field would allow potential train- 
ees to locate courses running elsewhere if, for 
example, a course of interest was fully booked/ 
cancelled, or if his/her expectations were not 
matched by the learning objectives of the chosen 
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course. This section is not intended to be a holding 
place for event websites. Instead, it will act as a 
bill-board for upcoming events, listing a minimal 
set of information to allow users to decide whether 
particular events are relevant to them, plus a hyper- 
link to the actual event website. This could be useful 
if, for example, during a course, trainers could point 
to the BTN as the place where trainees could access 
all the materials, as any uploaded materials could be 
searched for using the relevant course-name tag. This 
would have the additional effect of providing indir- 
ect access to the course for remote trainers, allowing 
them to see what type of material is being taught and 
where. At present, the list of events can be sorted by 
title, venue or date. Ultimately, the aim is also to 
integrate this into a calendar view option. 

Useful links 

This section lists the URLs of bioinformatics training 
providers, promotes awareness and allows informal 
networking and exchange of experiences. It also rep- 
resents a platform to discuss new ideas and initiate 
white papers on best practices. 

BIOINFORMATICS TRAINING: 
WHAT IS OUT THERE AND HOW 
THE BTN FITS 

There are several initiatives to make bioinformatics 
course materials available. We review these here and 
summarize how the BTN differs from these initia- 
tives. Our analysis reveals considerable variation in 
the functionalities provided by these sites, and in 
the maintenance and updating mechanisms for train- 
ing materials. The major challenges that we face 
when looking for bioinformatics training materials 
prompted us to look for the following characteristics: 

(i) Accessibility: is it possible to download the ma- 
terials without logins or fees? 

(ii) Date stamp: is it easy to work out when the 
materials were created and updated? 

(iii) Peer review: are the materials reviewed or rated 
by other trainers? 

(iv) Search functionality: how easy is it to retrieve 
training materials relevant to your specific area 
of interest? 

(v) Trainer/contact information: can I find out who 
developed the materials and get in touch with 
them? 

(vi) Purpose: when relevant, is it easy to link the ma- 
terials to the course (s) in which they were used? 



(vii) Community engagement: can I upload my own 
training materials and make them available for 
other trainers to use? 

Table 1 lists the online bioinformatics training re- 
sources that we reviewed. Some repositories might 
be described as interactive e-learning web portals than 
repositories of materials, but as these are also used by 
trainers to source training materials, we felt that it 
was appropriate to include them. When reviewing 
these sites, large numbers of dead links were encoun- 
tered and often it was difficult to access information 
about when was the last content updated. 

In light of these diverse endeavours, the BTN aims 
to provide a complementary, cross-institute portal 
for identifying training materials, and to help share 
and promote the recognition of, and feedback to, 
such institutional initiatives. It also provides an op- 
portunity to broadcast enterprises relevant to individ- 
uals involved in bioinformatics training including 
those devoted to education (which we define here 
as longer courses, such as Master's or undergraduate 
programmes), journal and publication opportunities 
in which bioinformatics training materials are the 
focus (e.g. tutorials), and funding opportunities for 
bioinformatics education. The BTN targets trainers 
(in contrast to other training repositories) by lifting 
duplication of effort from their shoulders. If con- 
structive collaborations can be built among the 
BTN's community, then projects may ultimately 
evolve whose whole is greater than the sum of 
their parts; this philosophy is the foundation on 
which the BTN is built. 



WHAT HAPPENS NEXT: TOWARDS 
AN OPEN, FREE AND 
COLLABORATIVE EVOLUTION 

The BTN website is active at www.biotnet.org. It 
differs from other repositories of training materials in 
that its focus is on trainers, rather than trainees. 
Other functionalities unique to the BTN include the 
ability to review the materials and rank them accord- 
ing to a star-rating system. We invite all those 
involved in bioinformatics training either to register 
and join, or simply to browse and provide feedback, 
so that the site can be improved and can evolve new 
functionalities. By openly and freely sharing our bio- 
informatics training content, wisdom and experi- 
ences, we can meet the growing training needs and 
challenges of present research environment. 
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Key Points 

• The Bioinformatics Training Network (BTN) is a global 
community-led project aimed to provide pragmatic solutions 
for the exchange of expertise, training materials and training 
experiences. 

• Constructive collaborations can be built by sharing experiences 
also at the level of Bioinformatics training. All materials in the 
BTN are made freely available under the Creative Commons 
sharealike license. 

• A free centralized repository and registry of training materials 
(and related Bioinformatics information) ensures efficient use 
and re-use of efforts made across the globe as well as a direct 
vessel to feedback and improve such resources. 

• The BTN also provides an opportunity to broadcast enterprises 
relevant to individuals involved in bioinformatics training, includ- 
ing education, journal and publication opportunities in which 
bioinformatics training materials are the focus as well as funding 
opportunities for bioinformatics education. 
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